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1. Introduction. 

The number partitioning problem is a classical problem from combinatorial optimization. 
One considers N numbers Xi,...,x N and one seeks to partition the set {1,...,JV} into 
k disjoint subsets Ji, . . . , such that the sums Kp = Kp(Ii, ...,1k) = ^2 n€lfj x n are as 
similar to each other as possible. This problem can be cast into the language of mean field 
spin systems [Merl,Mer2,BFM] by realizing that the set of partitions is equivalent to the set 
of Potts spin variables a : {1, . . . , N} — > {1, . . . , k} N . We then define the variables 

N 

K p (a) = J2^nla n =(3,P=h...,k. (1.1) 

71=1 

One may introduce a "Hamiltonian" as [Merl,BFM] 

k-l 

H N (a) = ^\K p (a)-Kp +1 (a)\ (1.2) 

/9=1 

and study the minimization problem of this Hamiltonian. In particular, if the numbers Xj 
are considered as random variables, the problem transforms into the study of a random mean 
field spin model. For a detailed discussion we refer to the recent paper [BFM]. 

Mertens [Merl,Mer2] has argued that the problem is close to the so-called Random En- 
ergy Model (REM), i.e. that the random variables Kp(a) can effectively be considered as 
independent random variables for different realizations of a, at least as far as their extremal 
properties are concerned. This claim was proven rigorously in a paper by Borgs et al.[BCP] 
in the case k = 2 (see also [BCMP]). 

In this paper we extend this result to the case of arbitrary k and under the additional 
constraint that the cardinalities of the sets Ij are all equal. We formulate this result in the 
language of multi-dimensional extremal process. 

Let Xi, . . . ,Xn be independent uniformly distributed on [0,1] random variables. (We 
assume that N is always a multiple of A;.) Consider the state space of configurations a of 
N spins, where each spin takes k possible values a = (<7i, . . . , ajsr) £ {!,••■ , k} N . We will 
restrict ourselves to configurations such that the number of spins taking each value equals 
N/k, i.e. #{n : a n = (3} = N/k for all /3 = 1, . . . , k. Finally, we must take equivalence classes 
of these configurations: each class includes k\ configurations obtained by a permutation of 
the values of spins 1, . . . , k. We denote by the state space of these equivalence classes. 
Then 
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Each configuration a £ S^r corresponds to a partition of Xi,... ,X^ into k subsets of 
N/k random variables, each subset being {X n : a n = /?}, (3 = 1,... ,/c. Then the vector 
F(cr) = {^(o-)}^ with the coordinates 

N 

Y?{o) = K p (a) - K p+1 (a) = £ X n (I {(T „ =/3} - l {(T „ =/m} ), /? = 1, . . . , k - 1, (1.4) 

n=l 

measures the differences of the sums over the subsets. Our objective is to minimize its norm 
as most as possible. Our main result is the following theorem. 

Theorem 1.1: Let 

V p {&) = kT^{2irN)- 1 k^(kl)^2V6\Y (3 (a)\, (3 = 1,... ,k—l. (1.5) 
Then the point process on 

converges weakly to the Poisson point process on with intensity measure given by the 

Lebesgue measure. 

Clearly, from this result we can deduce extremal properties of H^(a) = Y%~=i l^ /? ( cr )l 
straightforwardly. 

Remark: Integer partitioning problem. It is very easy to derive also from our Theo- 
rem 1.1 the analogous result for the integer partitioning problem. Let Si, . . . , Sjy be discrete 
random variables uniformly distributed on {1, 2, . . . , M(N)} where M{N) > 1 is an integer 
number depending on N. Let us define 

N 

DP (a) = Y,Sn(K Wn =p } - 2 {<T „ =/3+1} ). 
n=i 

Theorem 1.2: Assume that M(N) — > oo as N -> oo such thatlim N ^ 00 (M(N))- 1 k N ^ k - 1 ^ = 
0. Let 

W p (a) = M(N)- 1 k^{2TrN)- 1 k^(k\)^2V6\D (3 (a)\, (3 = 1,... ,k—l. (1.6) 
Then the point process on 

^ $(W 1 (a),...,W k - 1 (<T)) 
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converges weakly to the Poisson point process on M + with the intensity measure which is 
the Lebesgue measure. 

Proof. It follows from Theorem 1.1 by the same coupling argument as in the proof of Theo- 
rem 6.4 of [BCP]. 

The difficulty one is confronted with when proving Theorem 1.1 is that the standard 
criteria for convergence for extremal processes to Poisson processes that go beyond the i.i.d. 
case either assume independence, stationarity, and some mixing conditions (see [LLR]), or 
exchangeability and a very strong form of asymptotic independence of the finite dimensional 
marginals [Gal,BM]. In the situation at hand, we certainly do not have independence, or 
stationarity, nor do we have exchangeability. Worse, also the asymptotic factorization of 
marginals does not hold uniformly in the form required e.g. in [BM]. 

What saves the day is, however, that the asymptotic factorization conditions hold on 
average on S^r, and that one can prove a general criterion for Poisson convergence that 
requires just that. 

Thus the proof of Theorem 1.1 involves two steps. In Section 2 we prove an abstract 
theorem that gives a criteria for the convergence of an extremal process to a Poisson process, 
and in Sections 3,4 we show that these are satisfied in the problem at hand. 

Unfortunately, and this makes the proof seriously tedious, for certain vectors cr,a', there 
appear very strong correlations between Y{a) and Y(a') that have to be dealt with. Such a 
problem did already appear in a milder form in the work of Borgs et al [BCP] for k = 2, but 
in the general case k > 2 the associated linear algebra problems get much more difficult. 

Remark: The unrestricted problem. These linear algebra problems prevented us to 
complete the study of the unrestricted problem (that is when the sets I\ , . . . , Ik are not 
necessarily of size N/k) in the case k > 2. In Section 5 we give a conjecture for the result 
similar to Theorem 1.1 in this case and explain the drawback in the proof that remains to 
be filled in. 

Remark: Dynamical search algorithms. It would be interesting to investigate rigorously 
the properties of dynamical search algorithms, resp. Glauber dynamics associated to this 
model. This problem has been studied mainly numerically in a recent paper by Junier and 
Kurchan [JK]. They argued that the dynamics for long times should be described by an 
effective trap model, just as in the case of the Random Energy Model. This is clearly going 
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to be the case if the particular updating rules used in [BBG1], [BBG2] for the REM will 
be employed, namely if the transition probability p(o~,o~') depends only on the energy of the 
initial configuration. In the REM this choice could be partly justified by the observation 
that the deep traps had energies of the order — N, while all of their neighbors, typically, 
would have energies of the order of 1, give or take y/\nN. Thus, whatever the choice of the 
dynamics, the main obstacle to motion will always be the first step away from a deep well. 

In the number partitioning problem, the situation is quite different. Let us only consider 
the case k = 2. If a is one of the very deep wells, then 

N 

H N (a) = | Xi<Ti\ « 2~ N VN. (1.7) 

i=l 

If <r J denotes the configuration obtained from a by inverting one spin, then 

H N (a j ) ~ 2\ Xj \. (1.8) 

For a typical sample of x^s, these values range from 0(1/N) to 1 — 0(1/N). Thus, if we 
use e.g. the Metropolis updating rule, then the probability of a step from a to a- 7 will be 
~ exp(— 2/3|xj|). It is by no means clear how high the saddle point between two deep wells 
will be, and whether they will all be of the same order. This implies that the actual time 
scale for transition times between deep wells is not obvious, nor it is clear what the trap 
model describing the long term dynamics would have to be. 

Of course, changing the Hamiltonian from H(a) to In H(a), as was proposed in [JK], 
changes the foregoing discussion completely and brings us back to the more REM-like situa- 
tion. 

Acknowledgements: We thank Stephan Mertens for introducing us to the number parti- 
tioning problem and for valuable discussions. 

2. A general extreme value theorem. 

Consider series of M random vectors V iM = (V± M , ... , V^) e M. p + , i = 1, . . . , M. 

Notation. We write J2 a (i) wnen the sum is taken over all possible ordered sequences of 
different indices {i\,... ,ii} C {1, . . . ,M}. We also write J2 a ( ri ) a(r H )(0 wnen the sum 
is taken over all possible ordered sequences of disjoint ordered subsets a(r\) = (ii, . . . ,i ri ), 
a(r 2 ) = (v 1+ i,... ,V 2 ),... , a(r R ) = (i ri+ ... +rR _ 1+1 , . . . ,i ri+ ... +rR ) of {1,... ,M}. 
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Theorem 2.1: Assume that for all finite I = 1,2,... and all set of constants J 3 - > 0, 
j = 1, . . . , I, (5 = 1, . . . ,p we have 

E v(vl M <c?Vj = l,...,l,P=l,...,p)^ J] cf, M^oo. (2.1) 
Then the point process 



a (i)=(h,...,h) r!-'' 

p— i, . - . ,p 



M 

n M = E^ M ,..^ M ) (2-2) 
i=i 

on IR+ converges weakly as M — ► oo to i/ie Poisson point process V p on R!J_ u>ii/i £/ie intensity 
measure which is the Lebesgue measure. 

Proof. Denote by 11^(^4) the number of points of the process IT^ in a subset A C R+. 

The proof of this theorem follows from Kallenberg theorem [Kal] on the week convergence 
of a point process 11^- to the Poisson process IP. Applying his theorem in our situation weak 
convergence holds whenever 

(i) For all cubes A = Y§ =1 [aP , V) 

EU P M (A)^\A\, M^oo. (2.3) 

(ii) For all finite union A = [jf =1 Y\$=i [ a n ) °f disjoint cubes 

¥(U P M (A) =0)^e-l A l, M^oo. (2.4) 

Our main tool of checking (i) and (ii) is the inclusion-exclusion principle which can be 
summarized as follows: for any / = 1, 2, . . . and any events 0\, . . . ,Oi 

l k 

K n °0=£ e mm n^) (2.5) 

i=l,...,l k=0 ^ fc =(n,... ,i fc )c{i,... ,0 j=l 

»l<»2<--< i /s 

where Oi j are complementary events to O^. . We use (2.5) to "invert" the inequalities of 
type {Vf M > a 13 }, i.e. to represent their probability as the sum of probabilities of opposite 
events, that can be estimated by (2.1). The power of the inclusion-exclusion principle comes 
from the fact that the partial sums of the right-hand side provide upper and lower bounds 
(Bonferroni inequalities, see [Fe]), i.e. for any n < [1/2]: 

2n k 2n+l k 

e e (-i) fc p(n^)^ p ( n^E e (-i) fe F(n^,). 



fc=0 A k = (i 1 ,... ,i k ) j = l i=l,..., I k=0 A k = (i 1 ,... ,i k ) j = l 

C{l,...,i} C{l,...,i} 



(2.6) 
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They imply that it will be enough to compute the limits as N f oo of terms for any fixed 
value of I. Using (2.5), we derive from the assumption of the theorem the following more 
general statement: Let A\,... ,Ai G be any subsets of volumes \A\\, . . . ,\Ai\ that can be 
represented as unions of disjoint cubes. Then for any mi, ... , m/ 

i 

nVi,M G Aj Vi G a(m r ),Vr = 1, . . . ,Z) ^ [J l^p-. (2.7) 

a(mi),a(m2),...,a(mi) r=l 

Let us first concentrate on the proof of this statement. We first show it in the case of one 
subset, I = 1, which is a cube A = n^=i[ a ' 3 ' Let m = 1. We denote by Y1ac{i p} ^ e 
sum over all 2 P possible ordered subsets of coordinates : A denotes the subset of coordinates 
(3 such that the inequalities Vf M < a 13 are excluded leaving thus Vf M < hP . Then by (2.5) 
applied to fl^ =1 {<M > ^} 

M M 

i=l i=l 

M 

= E E (-1) | - 4| P(^m<^^ + ^W>V/3=1,...,p) (2.8) 

i=l -AC{l,...,p} 

M 

= E (- 1 )'" 4 ' E P (<m < + bPfyeAM = 1, • • • ,P)- 

_4c{l,...,p} i=l 

The interior sum in (2.8) X)i=i ^(') converges to 113=1 (°^I/3^ + ^I/3e^) by the assumption 
(2.1). Thus 

M p p 

Jim £P(? iiAf €A)= ^ (-l)I^JJ(o%0>i + ^ e >i) = II(6 /3 -o /3 ) = ^l- 

i=l ^C{l,...,p} /3=1 /3=1 

(2.9) 

Now let m > 1. Denote by YIai ^ ^ m the sum over all 2 mp ordered sequences of all 2 P 
unordered subsets A C {1, . . . ,p}. Here Aj is the subset of coordinates corresponding to the 
jth index in the row a(m) = (ii, . . . , i m ). Then by (2.5) 

J2 V(Vi, M G A Vi G a(m)) = ^ P ((/ < lf M < b 13 \/i £ a(m), V/3 = 1, . . . ,p) 

a(m) a(m) 

= E E (-i)^ il+H ^^ 

a(m) A,- ,^tm 

= Yl (-l)^ 1 '^"^™ 1 ^ P(^ M < a^I^. + 6% e ^Vi = i,-€a(m),Vj = l,... ,m, 

Ai,... ,Am a(m) 

(2.10) 
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By (2.1) applied to the interior sum of (2.10) Yl a (m) ^(") we S e ^ : 



lim^ Y, nV*,M G AVi e a(m)) = E (_l)IA|+-|^l TJ (a%^. +6%^, ) = 

a(m) Ai,...,A m ,j = l/3=l 



Assume now that / > 1 and A r = rTg=i[ a f > r = 1, . . . , I. Then 
X] P(y iiM €i4 3 -Vi€a(m r ),Vr = l,... ,i) 

a(m 1 ),a(m 2 ),... ,a(mi) 

E E (-i) | - 41i+ - +| - 4L ^«m<« /3 ^ + ^i^ 

a(m 1 ),a(m 2 ),... ,a(mi) A\,... ,A\ ni , 

a^ 

• ■ • ■ '"^i ' ■ ' ■ ' mi 

V« = ij G a(m r ),Vj = 1, . . . , m r , Vr = 1, . . . , Z, V/3) 

^,■■■,^1,, a(mi),a(m 2 ),...,a(mi) 

F (^m < a%*Aj +6 /3 I/3€^rVi = i 3 - Ga(m r ), Vj = 1, . . . ,m r ,Vr = l,... ,Z,V/3 

(2.11) 

Due to (2.1) applied once more to the interior sum X)a(mi) a (m,) F (')> (2-11) converges 

to 

^i.-.^ -^-^ r=lj=l/J=l 

= (_1)I^I + - + IA 1 „ 1 I ... ^ 

w-i w-i 

I — 1 m r p 
r=l j = l 0=1 

= iAii mi |A2r 9 ---i^r'. (2.12) 

Let finally A x = \J s k=1 A ltk , . . . ,A L = [j k=1 A t;k be unions of si,... ,si disjoint cubes 
respectively. Then we may write: 

E W(Vi,M € Aj Vt G a(m r ),Vr = 1, ... ,1) 

a(mi),a(m 2 ),... ,a(m;) 

E ••• E E (2.i3) 

m i,i >■ ■ • • m i,s 1 >° m !, Si >° <*("n,i) 

"H,lH h">l,i]=">l m I,lH l" m !,s,= m i ■•■ .«("»(, l) "('"i.s,) 

IP (Xm G -4r,fcVi G a(m rjfc ) Vr = 1, . . . , Z, V/c = 1, . . . , s r ) 
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and apply to the interior sum Yl a ( mi i) a (mi ) ^(') * ne statement (2.7) about cubes just 
proven by (2.12). Then (2.13) converges to 

E ■•• E nfti^r— II 2 ft i^i— 

mi )lt ... ,m ljSl >0 m l ^ 1 ,... ,m lfSi >0 r =l k=l r = l rn r ,1 > • ■ * . m r, s r > fc = l 
m l,lH l" m l,s 1 =m l j^H j-mj 5 > =m Z rn r,l~\ h^r,s r -^r 



(2.14) 

This finishes the proof of the statement (2.7). 

Now we are ready to turn to the proof of the theorem. The condition (i) has been already 
shown by (2.9). To verify (ii), let us construct a cube B = Y\$=i [0> m &x-i=i,... ,l bf) of volume 
\B\, then clearly A C B. For any R > we may write the following decomposition: 

R 1 

F(U M (A) = 0) = ~ { Y P (^> M ^ B \A\/i G a(r), 5 Vi a(r)) 



r=0 a(r) 

+ P(n M (^4) =0,U M {B) >R) =h(R, M) +I 2 (R,M). 



(2.15) 



Applying the inclusion-exclusion (2.6) principle to M — r events {Vi G" B} for i G" a(r), we 
may bound h(R, M) for all n < [(M — r)/2] by 

Y^Y^T^ Y ¥{ y iM eB\AViea(r), V iM € B Vi € a(k)) > h(R, M) 



rl ^ k\ 

r=0 k = a{r),a(k) 
R 2n+l /-.Nfc 

^Y^Y~JT Y nV itM €B\AVi€a(r),V itM €BVi€a(k)). 

r=0 ' fc=0 ' a(r)a(k) 

(2.16) 

Then for any fixed n > 1, the statement (2.7) applied to the subsets A/B and B imply: 

£ m*L V '-^l" > lim 7 lW M) > f M £ ( - 1) :,' B|t . (2.17) 

r=0 k=0 r=0 k=0 

Since n can be fixed arbitrarily large, it follows that 

lim l 1 (fl,M)=e-l fi iy |5 \ i|r . (2.18) 

r=0 



The statement (2.7) also gives 



IB 



lim I 2 (B, M) < lim P(nj M ( J B) > B) = lim i- V P(*U G B Vi G a(B)) 

M^oo M^oo M^oo ft! it! 

a(fl) 



(2.19) 
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By choosing R large enough, the limit (2.19) can be done as small as desired and the sum 
(2.18) can be done as close to the exponent e' B \ A '~' B as wanted. Hence, liniM^oo P(n]^ (A)) = 
e~' A L This concludes the proof of the theorem. 

3. Application to number partitioning 

We will now prove Theorem 1.1. In fact, the proof will follow directly from Theorem 2.1 
and the following proposition: 

Proposition 3.1: Let 

S(k,N) =k N {2irN)^k%(kl)- 1 (3.1) 

be borrowed from (1.3). We denote by (•) the sum over all possible ordered 

sequences of different elements o/Ejv- Then for any I = 1,2,... , any constants Cj > 0, 
j = 1, . . . , I, (5 = 1, . . . , k — 1 we have: 

- J] (2(2vr)" 1 / 2 cf). (3.2) 

3 = 1,... ,1 
(3 = 1,... ,k-l 



Informal arguments. Before proceeding with the rigorous proof, let us give intuitive ar- 
guments supporting this lemma. 

The random variables , Y = are the sums of independent identically distributed 

^2{N/k)vax x 

random variables with the expectations EY^cr- 7 ) = and the covariance matrix B^(a 1 , ... , a ) 
with the elements 

Pn _ cov (yy),y>*)) _ Eli(i{<=/3 } - g {<=/ 3+i } )(g {<=7} - 1 {<=7+ 1}) 

^ s 2{N/k)vsxX 2{N/k) ' { ' ' 

In particular: 

6jif = l, ^f +1 = -l/2, = for 7 ^ + 1, Vt = l,... ,fc-l. (3.4) 

Moreover, the property that ftf^ 7 = o(l) as iV — > oo for all i ^ j, (3, 7, holds for a number 
R(N, I) of sets a 1 ,... ,a l G S®' which is i?(iV, Z) = |Ejv|'(1 + o(l)) = 5(fc, iV)'(l + o(l)) with 
o(l) exponentially small as N — > 00. For all such sets a 1 , . . . , a 1 , by the Central Limit Theo- 
rem, the random variables , Y = = should behave asymptotically as centered Gaussian 

^/2(v/fc)varx J ^ J 
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random variables with covariances b^'J = l{j =J - i/ g =7 } + (— l/2)l{ i=J)7=) 3+i} +o(l). The deter- 
minant of this covariance matrix is 1 + o(l). Hence, the probability P(-) defined in (3.2) that 
these Gaussians belong to the exponentially small segments 

[—c?S(k, N)~ 1 / ( - k ~ 1 \c^S(k, A/") _1 /( fc_1 )] isoftheorder J] ^tt)" 1 / 2 ^ iV)-V(fc-i)). 

3 = 1,... ,i 
13 = 1,... ,k-l 

Multiplying this probability by the number of terms R(N, I) we get the result claimed in (3.2). 

Let us turn to the remaining tiny part of E®' where a 1 , . . . ,a l are such that 6^' 7 -/-> 
for some i 7^ j as N — ► 00. Here two possibilities should be considered differently. The first 

one is when the covariance matrix BnIg 1 , ... , a 1 ) of . y ^ a = is non-degenerate. Then 

iVV ' ' ; ^/2(Af/fc)varx & 

invoking again the Central Limit Theorem, the probability P(-) in this case is of the order 
(detB N (a\... ,a'))- 1/2 (2(2^)" 1/2 cf 5(fc, iV)" 1 /^-!)). 

3 = 1,... ,1 
13=1,... ,k-l 

But from the definition of b^'J (detBj^^a 1 , . . . , cr')) -1 / 2 may grow at most polynomially. 
Thus the probability P(-) is about S(k,N)~ l up to a polynomial term while the number of 
sets a 1 ,... ,a l in this part is exponentially smaller than S(k,N) 1 . Hence, the contribution 
of all such a 1 ,... ,a l in (3.2) is exponentially small. 

The case of a 1 , . . . ,a l with B{a x , . . . , a 1 ) degenerate is more delicate. Although the num- 
ber of such a 1 , . . . , a 1 is exponentially smaller than S(k, N) 1 , the probability P(-) is exponen- 
tially bigger than S(k, N)~ l since the system of l{k — 1) random variables \Y (c r *)} ( g = i ' k—i 
is linearly dependent! First of all, it may happen that there exist 1 < i\ < i<i < ■ ■ ■ < i p < I 
such that the basis of this system consists of (k — l)p elements {Y^{a l3 ')}^~\' "' ' p k _ 1 . Then 
the assumption that the elements a 1 , . . . ,a l of S^r must be different, plays a crucial role: 
due to it the number of such sets a 1 , . . . ,a l in this sum remains small enough compare to 
the probability P(-) 5 consequently their total contribution to (3.2) vanishes. 

Finally, for some sets a 1 , . . . ,a l , there is no such p < I: for any basis, there exists a number 
j G {1, ... ,1} such that the random variables Y^cr 5 ) are included in the basis for some non- 
empty subset of coordinates and are not included there for the complementary non-empty 
subset of 0. This last part is clearly absent in the case k = 2. It turns out that its analysis 
is quite tedious. We manage to complete it only in the case of the constrained problem by 
evaluating the number of such sets a 1 ,... ,a l where each of spins' values {1, . . . , k} figures 
out exactly N/k times and by showing that the corresponding probabilities P(-) are negligible 
compare to this number. The only drawback that remains in the study of the unconstrained 
problem is precisely the analysis of this part. 
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Proof of Proposition 3.1. In the course of the proof we will rely on four lemmata that will be 
stated here but proven separately in Section 4. Let 

/£'" V ({fo})=Eexp( J E ^V)) (3.5) 

3=1,... 

be the characteristic function of the random vector (2(N/k)vai X)~ 1 ^ 2 {Y ,3 (a : ')} j=i,...,i, . 

P—l, ... ,k — 1 

Here i = {t/3,j} ,3=1,... ,k-i, is the vector with (A; — 1)/ coordinates. Then 

' 3 = 1,... ,1 

P(V/3=1,... ,jfe-l,Vj = l,... B^=L= < (3.6) 

V y/2{N/k)vsiX S(k,N)T&' 

1 fit e «/9,j<f fif(fc,JV)^r _ e -itf,jc$S(k,N)tt 

= (2^t y ^ ({ ^ }) n ; ^- 

[-D,D]'<fc-i) pU]:::',k-i 

It will be convenient to have in mind the following representation throughout the proof. Any 
configuration a gives rise to k — 1 configurations c/ 1 ), . . . ,cr( fc_1 ) G { — 1,0, 1}^ such that 

^=lK=«-lK=W}. n = l,...,iV. (3.7) 

We now define the N x (k — 1) matrix C(a) composed of columns . . . , a^- k ~ l \ Then it is 
composed of types of k rows of length k — 1: Oq = (1, 0, . . . , 0), Oi = (— 1, 1, . . . , 0), O2 = 
(0, -1, 1, 0, . . . , 0), . . . , O fc _ 2 = (0, . . . , 0, -1, 1), O k _ x = (0, . . . , 0, -1). They correspond to 
spin values 1, 2, . . . , k respectively: if a n = (3, then the nth row of C(a) is 0,3-1. 4 Each of 
these k rows is repeated N/k times in the construction of C(a). Then 

71=1 

Let C(a 1 , ... , ct') be the N x (fc — 1)/ matrix composed by the columns 
0-1,(1)^1,(2)^ _ _ )Cr i,(fc-i) )0 -2,(i) ; )Cr *,( fc -i). Then it is easy to see that the function 

In ' "' CT ({*/3,j}) i s product of iV functions 

N 

"(fa J ))-n^ X p( w ;, arX ic^......-) t )..) 

IJi iivw^i-H* 1 ,... y)t). ' 



4 The case fc = 2 is particular, since here C(cr) is the vector with elements ±1; i.e. in this case this 
reparametrisation just corresponds to passing from values {1,2} to { — 1, +!}■ 
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where {(^((T 1 , . . . , <J l )t} n is the nth coordinate of the product of the vector t = {tpj} p=i,... ,k-i, 
with the matrix C(a 1 , ... , a 1 ). 

We will split the sum of (3.2) into two terms 

E p (-)= E E ( 3 - 9 ) 

ranker 1 ,... , CT «)=(fc-i)i ranker 1 ^xifc-iji 

and show that the first term converges to the right-hand side of (3.2) while the second term 
converges to zero. 

We start with the second term in (3.9) that we split into two parts 

E no = Jh + 4 

rankcv 1 ,... ,o-*)<(fc-i)i 

In the first part the sum is taken over ordered sets a 1 ,... ,a l of different elements of Sjv 
with the following property: the rank r of C{a l , ... , a 1 ) is a multiple of — 1) and, moreover, 
there exist configurations a 11 ,... J a lr / ( - k - 1 '> such that all of ,l1 , ' ll , . . . }< j( k ~ 1 ' > ' lr / < - k - 1 '> 
constitute the basis of the columns of the matrix C(a 1 , . . . , a 1 ), i.e. the rank of C(a %1 , . . . , cr v /<' £ - 1 ) ) 
equals r. Consequently, for any j G {1, . . . , Z} \ {«i, . . . , V/(fe-i)} all of a^'i , . . . , cr^ -1 )'- 7 
are linear combinations of the columns of the matrix C(cr n , . . . , ). In the remaining 

part, Jjy, the sum is taken over configurations a 1 , a 2 ,... , a' satisfying the complementary 
property: for any basis of the columns of C(a 1 , ... , a 1 ) there exist at least one configuration 
o % such that some of the configurations a^' 1 , . . . , cr^ -1 )' 1 are included in this basis and some 
others are not 5 . 

The following Lemma 3.2 shows that the sum is taken over sets of different a 1 ,. . . ,a l 
such that the matrix of the basis C(a 11 , . . . , cr v /(fc-i) ) contains at most [k r l ( - k ~ 1 '> — 1) different 
rows. 

Lemma 3.2: Assume that the matrix C(a 1 , . . . , a 1 ) contains all k l different rows. Assume 
that a configuration a is such that each linear combination of the columns 

of the matrix C(a 1 , ... , a 1 ). Then the configuration a is obtained by a permutation of spin 
values in one of the configurations a 1 ,... ,a l , i.e. a coincides with one of a 1 ,... ,a l as an 
element of Ejy. 

5 In the case k = 2 the term can obviously not exist. This leads to considerable simplifications. 



(3.10) 
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Remark: In the case k = 2, Lemma 3.2 has been an important ingredient in the analysis of 
the Hopfield model. It possibly appeared first in a paper by Koch and Piasko [KP]. 

In fact, if in Jjj the matrix C(<7 n , . . . , a v /( fc - 1 )) contained all /c r /( fc_1 ) different rows, then 
by Lemma 3.2 the remaining configurations 0" J with j £ {1, ... , 1} \ . . . , i r /(k-i)} would 
be equal to one of a 11 , . . . , a lr / ( - k - 1 '> as elements of S^r, which is impossible since the sum in 
(3.10) is taken over different elements of Eat. Thus there can be at most 0{(k r ^ k ~^ - l) N ) 
possibilities to construct C(o~ 11 ,... , a 1 *-/^- 1 ) ) in the sum J^. Furthermore, there is only a 
TV-independent number of possibilities to complete it by linear configurations of its columns 
up to C(a 1 , . . . ,a l ). To see this, assume that there are v < /c r /( fc_1 ) different rows in the 
matrix C(a %1 , . . . , cj v /( fc - 1 ) ) and consider its restriction to these rows which is the uxr matrix 
C(a h ,... Then C(a h , . . . , o^i^-v ) has the same rank r as C(a h ,.. . , a^/^-v). 

Now there are not more than 3( !/ W c_1 ) _r )) ways to complete the matrix C to a v x \{k — 1) 
matrix with elements 1, —1, such that all added columns of length v are linear combinations 
of those of C. But each such choice determines uniquely the coefficients in these linear 
combinations, and hence the completion of the full N x r matrix C(<r n , . . . ,a lr /< k -v) up to 
the N xl(k — 1) matrix C(a 1 , ... , a 1 ) is already fully determined. Thus the number of terms 
in the sum representing is smaller than 

k r/(k-l)_ 1 

v * 3 (»(Kk-l)-r)) = (( k r/{k-l) _ (3 n) 

The next proposition gives an a priori estimate for each of these terms. 

Lemma 3.3: There exists a constant K(k, I) > such that for any different a 1 , . . . ,a l G 
Sat, any r = rank C(a 1 , . . . a 1 ) < (k — 1)1 and all N > 1 

P(V/? = 1,... ,fc-l,Vj = l,... ,1 - = < ^—T~) <KS(k,N)- r / ( - k -^N 3r / 2 . 

(3.12) 

Hence, by Lemma 3.3 each term in is smaller than KS{k, N)~ r ^ k ~^ N 3r / 2 with the 
leading exponential term /c-^/^- 1 ). It follows that = 0([(k r ^ k -^ - l)k- r ^ k -^] N ) ^0 
as N — ► oo. 

Let us now turn to in (3.10). The next proposition allows to evaluate the number of 
terms in this sum. 

Lemma 3.4: Let be any N x q matrix of rank r < q. Assume that for any N > 1 
it is composed only of R different rows taken from a finite set V of cardinality R > k. 
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Let Qpf(R,t) be the number of configurations a such that the matrix Dpj completed by the 
columns o-^\ . . . , cj( fc_1 ) has rank r + t where 1 < t < k — 2. Then there exists a constant 
K(R,t,k) > 0, depending only on R,t,k, such that 

Qn(R, t) < K(R, t, fc) ^|±i|Zg! , (3.13) 



Now, to treat Jj^, consider a 1 , . . . ,a l such that (k — l)m + t\ + £2 + ■ ■ ■ + ts = r columns 
of C(o" 1 , ... , a 1 ) form a basis for the span of all column vectors of this matrix. Then there 
exist a 11 , . . . , a %m such that all of a^' 1 " are included in the basis for all v = 1, . . . , k — 1, 
p = 1, . . . , m, and there exist <r jl , . . . , £j Js such that among a^^ q t q > 1 configurations are 
included in the basis and other k — 1 — t q > 1 are not, q = 1, . . . , s. By Lemma 3.4 the 
number of possibilities to construct such a matrix C(a 1 ,... , a 1 ) is 



/ umN TT ( N ( t q + 1 )/ k )- \ _ . jNm TT f/ , i \JV(t, + l)/fc 

v fc JUL ((jv/fc) !)*«+! / 11^ + ^ 



s=l 



up to leading exponential order. The probability in (3.9) is already estimated in Lemma 3.3: 
it is 

0(N 3r/2 S(k, N)~ r/( - k ~ 1) ) ~ k~ Nr/{k ~ 1) = fc- Ar ( m ( fc - 1 )+*i+*2+---+t 3 )/(fc-i) > 

Thus, to conclude that Jfj — > exponentially fast, it suffices to show that for any fc = 3, 4, . . . 
and any t = 1,2,... , A; - 2 we have (t + l)(*+ 1 )/ fc /c-*/( fc - 1 ) < 1, which is reduced to the 
inequality 

(f,(k,t) = ^^ln(t+l)-^-lnfc<0. 

It is elementary to check that ^^'^ < for all k > t + 1 and t > 1. Then, given t, it suffices 
to check this inequality for the smallest value of k which is k = t + 2, that is that 

%l>{k) = (k— l) 2 ln{k - 1) - k(k - 2) In jfe < 0. 

This is easy as ip'(k) < for all > 3 and ip(3) < 0. Hence, — > as iV — > oo. Thus the 
proof of the convergence to zero of the second term of (3.9) is complete. 

We now concentrate on the convergence of the first term of (3.9). Let us fix any a G (0, 1/2) 
and introduce a subset TZf N C £® z : 

Kn = {°\--- V e Ejv :Vl<i<r<U<0,7,»7<M^7 
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N 

| E(I{ai=fl " ^ n =,})MK=v}\ < ^ +1/2 }. (3-14) 

n=l 

This subset can be constructed as follows. Take a 1 where each of k possible values of spins 
is present N/k times. Divide each set Ap = {i € {1,...,N} : o\ = ft}, ft € {l,...,fe}, 
into N/k + 0(N a+1 / 2 ) pieces Ap tl of length N/k 2 + 0{N a+1 / 2 ). Then the spins of a 2 have 
the same value on the subsets of indices which are composed by k such pieces Ap a taken 
from different Ap, ft = 1,... ,k. Next, divide k 2 subsets Ap a into k pieces Ap tlt s- The 
spins of a 3 have the same values on the subsets composed by k 2 such pieces Ap tl ,s of length 
N/k 3 + 0{N a+1 / 2 ) taken from different Ap n , etc. It is an easy combinatorial computation 
to check that with some constant h > 

\ n% N \ < k Nl exp(-hN 2a ) (3.15) 

from where by (1.3) 

\Kf ;N \ = S(k,N) l (l + o(l)), iV^oo. (3.16) 

It is also not difficult to see that for any a 1 ,... ,a l 6 Tlf N the rank of C(a 1 , . . . ,a l ) equals 
(k — 1)1. Note that the covariance matrix (see (3.3)) can be expressed as 

Bn{(J ' • • • ' a ] = 2(iV/A:)varX ' (3 " 17) 

Thus by definition of TZ^ t , its elements satisfy 

b?? = 0(N a - 1 ' 2 ) Vft,7,i^j, (3.18) 

uniformly for Vcr 1 , . . . ,a l € IZ.% v Therefore, for any a 1 ,. . . , a 1 € Tlf n , deti?Ar(cr 1 , ... ,a l ) = 
1 + o(l) and consequently the rank of C^cr 1 , ... ,cr') equals (k — 1)1. 

By Lemma 3.3 and the estimate (3.16) 

J2 P(0 < k Nl e- hN " a KN !t V'- 1 W 2 S(k,N)- 1 0. (3.19) 

rankcv 1 ,... ,<r ! )=(fc-i)z 
To complete the study of the first term of (3.9), let us show that 



P(.) - (2vr)-( fc - 1 )'/ 2 H (2<£) 



(3.20) 
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with P(-) defined by (3.6). Using the representation (3.6), will divide the normalized proba- 
bility P(-) of (3.6) into five parts 

S(k,N) l ( J] (2cf)- 1 )p(-) = ^4(c7 1 ,... ,a l ) (3.21) 

i=i,...,fc-i, i=\ 

3 = 1,... ,1 

where: 

ee (27r)-'( fc - 1 ) / e-^^ 1 ,..,^)^ -Q apj, (3.22) 

_ (3=1,... ,fc-l, 

\\i\\<eN 1 /^ 3 = 1,-.., I 



and 



4 = (2vr)-^- 1 ) / (/^'■■- i ({^})-e-^^'-- i )*/ 2 ) J] ( 3 - 23 ) 

_ /3=i, •■■ ,fc-i, 

||t||<eATi/6 3 = 1,...,! 

4 = ( 2 ^)-^- 1 ) / ./a- - 1 ({/,,}) n ^ ( 3 - 24 ) 

, 0=1,... ,k-l, 

eA rl / 6 <||tl|<<5v / lV 3=1, .-.,! 

4 ^r^- 1 ) | ./-' t ' si/,,}) 

||tl|<5%/jV 

3=1,... ,! 3 = 1,...,! 

(3.25) 

J^EE(2 7 r)-'( fc - 1 ) lim / /.v ^ *!/,,}) 

[-D,D]i(fc-i)n||tl|>5v/iV 



3 = 1,...,! 



=1,... ,k-l - • >,J~J 

for values 5, e > to be chosen appropriately later. We will show that there is a choice such 
that Iifia 1 ,... ,a l ) -► for i = 2,3,4,5 and /^(cr 1 ,... ,a z ) -► ^vr)"^" 1 )'/ 2 , uniformly for 
a 1 , ... ,0"' £ ^-"^r as ~~ * 00 ■ These facts combined with (3.16) imply the assertion (3.20) 
and complete the proof of the proposition. The following lemma gives control over some of 
the terms appearing above. 

Lemma 3.5: There exist constants C > 0, e > 0, 5 > 0, and ( > 0, such that for all 
a 1 , . . . ,a £ T^ftf, the following estimates hold: 

(i) 

\fN'-' a \{t^})-e- rBNial — al)m \ < 9^ e -tB N (a\...^)t/2^ forallWtWKeN 1 / 6 

V N 

(3.27) 
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(ii) 

and 

(Mi/ 1 

\fN'-' a \{tfS,j})\ < e~ <mi2 for all \\t\\ < 5VN. (3.29) 



We can now estimate the terms P N . First, by a standard estimate on Gaussian integrals, 



, a 1 ) = ((2^ k -^ l detB N (a\. . . , a 1 ))^ 2 + o(l) 
= (2vr)-( fc - 1 ^ 2 +o(l), iV^oo, 



(3.30) 



where o(l) is uniform for a 1 , . . . ,a l G 72."^ by (3.18) and (3.4). Thus Ijj gives the desired 
main contribution. 

The second part /^(u 1 ,... ,a l ) = 0(N~ 1 ^ 2 ), uniformly for a 1 ,... ,a l G Tlf N by the 
estimates (3.27) and (3.18), (3.4). The third part /^(cr 1 ,... ,0"') is exponentially small by 
(3.29). To treat /^(cr 1 , ... , a ), we note that for any e > one can find Nq such that for 
all N > and all t with \\i\\ < 5^/~N the quantity in square brackets is smaller than e in 
absolute value, and apply again (3.29). Finally, we estimate 



I%(a\...* l )\<(2«)-^-V f |/£--* , ({t /9J })| [J dt^. 

^ — 3 = 1,... ,i, 



(3.31) 



|t||>5-v/lV 



3=i,... ,fc-i 



For any u 1 , . . . , a 1 G 7£f N the matrix C^cr 1 , . . . , cr') contains all k l possible different rows and 
by (3.8) f N {{tf3j}) is the product of k different characteristic functions, where each 
is taken to the power N/k l (l + o(l)). Let us fix from a set of k l rows of C(a 1 , ... ,a l ) any 
(k — 1)1 linearly independent and denote by C the matrix composed by them. There exists 
rj(S) > such that yj tC T Ct/ (2(1 / k)v&r X) > rj for all F with > 5. Changing variables 
s = tC T I ^/2(N/k)v&rX one gets the bound 



l r5 IrJ- 



,o- 



<(27r)-'( fe - 1 )(2(iV/A;)varX) z ( fc - 1 )/ 2 (det C) 



n 



g=i,...,k-i, 

\\ S \\>V j = i,...,i 



Nk- l (l+o(l)) 



< CN l(k-l)/2^ _ /i(r?)) 7Vfc- i (l+o(l))-2 



/ n 

jf. 13=1,... ,k-1, 

\S\\>V 3 = 1,..., I 



ds/3j, 
(3.32) 
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where h(rj) > is chosen such that \{e ls — l)/s\ < 1 — h(rj) for all s with |s| > r//((fe — 1)1) 
and C is a constant independent of the set a 1 ,... ,a l and N. Thus /^(cr 1 ,... ,a l ) — ► 0, 
uniformly for a 1 , . . . , a' G and exponentially fast as iV — > oo. This concludes the proof 

of (3.20) and of Proposition 3.1. 

4. Proofs of Lemmas 3.2, 3.3, 3.4, 3.5. 

Proof of Lemma 3.2. Let first 1 = 1. Without loss of generality we may assume that the first 
fe rows of C(a 1 ) are different. Then for all i = 1, . . . , k — 1, the following system of equations 
has a solution: 

A i — CT i 
-A« + A? = 



\W i \W _ =.(*) 

_A fc-2 + A fc-1 - 

-A?li = *J°- (4-1) 

Then necessarily 2^n=i ^™ ' = for alH = 1, 2, . . . , k — 1, since the sum of the left-hand sides 
of these equations equals 0. But for at least one j €. {1, . . . , k} and i = 1, . . . , fe — 1, crj*' ) / 0, 
for otherwise A^ = for all s = 1, . . . , k i = 1, . . . , fe — 1 and consequently C(a) is composed 
only of zeros, which is impossible. Without loss of generality (by definition of S^r we may 
always permute spin values) we may assume that / 0. 

We will use the following crucial property of the configurations a^, . . . , d^ k ~^: 

W = 1 => = 0, 4 J+2) = <>,..., = 0. (4.2) 

<#> = -1 => a^ 1 ) = 1, ^+ 2 ) = 0, . . . , 4 fc - 1} = 0. (4.3) 



It follows that, for a certain number t\ > 1 of pairs of indices nj, raf , ... ,71^,71^ G {1, . . . , k}, 
we must have that cr^V = 1 and cr^V = — 1, w = 1, . . . ,t\. We say that these 2t\ indices are 
"occupied" from the step j = 1 on, since, by (4.2) and (4.3), we know all values a^} = for 
all j = 2, 3, . . . , k - 1, = 1, and a$ = for all j = 3, . . . , fe - 1, u = 1, 2, . . . ,t x . We 

it -u, 

say that the other fe — 2t\ indices are "free" at step j = 1. Then we must attribute to at 
least ti of fe — 2ti spins a! 2 ' 1 with "free" indices the value = — 1 in order to ensure that 
Yl n =i = 0- We could also attribute to a certain number t 2 > of pairs of the remaining 
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k — 3ti spins with "free" indices the values a ( n > = ±1. Thus by (4.2), (4.3) for j = 2 we know 
the values of dn^ for j = 2, 3, . . . , k — 1 for at least 3ti + 2i 2 indices n. We say that they 

(3) 

are "occupied" from j = 2 on. Among them <7„ = 1 for the number of indices ii + i 2 and 

(3) 

a^, = for the others 2t\ + t 2 . Then we should assign to the number t\ + i 2 of the remaining 
k — 3ii — 2i 2 spins a n ^ with "free" indices the value a n ^ = — 1 to make J^^ =1 ci 3 ^ = 0. 
We could also attribute to a certain number t-j > of pairs of the remaining k — 4ii — 3i 2 
spins the values ±1. Hence, after the third step, At\ + 3i 2 + 2i3 indices are "occupied" etc. 
Finally, after (j — l)th step, jt\ + (j — l)i 2 + . . . + 2t 3 _i indices are "occupied", = 1 for 
ii + ■ ■ ■ + among these indices, and at the jth step we must put = — 1 for the same 
number t± + i 2 + • • • + ij-i of "free" indices to ensure that ^n=i ^« ^ = 0- But) if ii > 1 or 
ti = 1 but ti > for some 2 < z < k — 1, then, for some j < k — 1, we have 



(In fact, for j = fc — 1, if, ii > 1, then obviously — (k — l)ti < ti , and if ti = 1 but U > we 
have k— (k — 1) — 2 < 1). This means that at the jth step there are not enough "free" indices 
among the remaining k — jti — (j — l)t 2 — ... — 2ij_i ones such that we could assign = — 1 
to ensure Yl n =i &™ ^ = ^- H ence ; the only possibility is ti = 1 and £ 2 = £3 = • • • = t^-i = 0. 

So, at the first step 2 indices get "occupied" and at each step one more index is "occupied" . 
Thus there exists a sequence of k different indices ni,n 2 ,... , n& € {1, . . . , /c} such that 
(Tn^ = 1, (Tn} +1 = —1, CTn '' = for n / n i5 n i+ i, i = 1, . . . , fc — 1. Solving the system (4.1), 
we see that A$ = A^ +1 = • • • = A„ i+1 _i = 1, = for n ^ m, . . . ,n i+1 - 1. Hence, 
the configuration a is a permutation of the configuration a 1 such that a n = i, iff a\. = i, 
i = l,... ,k. 

Let us now turn to the case I > 1. We use induction. Consider A;' -1 possible columns. We 
denote linear combinations of them by A&\ a = 1, . . . , k l_1 . Then for any i = 1, . . . , fc — 1, 
the following system should have a solution 



k - jh - {j - l)t 2 2t 3 -_i < ti + 1 2 + • ■ • + ij-i. 



A« - Af > + A« 
AW - A« + A« 




(4.4) 



ly a A k-2 ^ A k-1 ~ 

Yi Q A k-1 — (J I 
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It follows that 

ZA 1 - A 2 — CT 1)Q , - <T 2 ,a 

\W , i)\W \W _=.(*) 

-A x + /A 2 - A 3 — (T 2 - (7 3 , 

(4.5) 

\W i o\W _ =.(») =.(») 

Given cr^ — a 2 \, . . . , cr^l i a — * ms system (4-5) of k — 1 equations has a unique solution, 
which does not depend on a = 1, ... , fe i_1 . Then a^ a — d 2 \, . . . , a — should not 
depend on a neither. We denote by 5^ = — crj+ 1)Q! . 

Let us consider two cases. In the first case we assume that, for some i = 1, . . . , k — 1 
and for some j = 1,... ,k-l, 5f / 0. Then it may take values ±1, ±2. Knowing each 
of these values, we can reconstruct in a unique way a^ a = and cr^i Q = which 
do not depend on a. (If <5^ = 1, then = 1 and oj+i = 0, if 5^ = — 1, then aj 1 ' 1 = 
and crj+i = —1 etc.). Then we can reconstruct the values a^ a = + J2m=t^ f° r 
i = 1, . . . , j — 1, <7 t ^ = — Y^ m =j ^ for t = j + 1, . . . , fc, which consequently do not 
depend on a. Since the sum of all k l left-hand sides of equations (4.4) equals zero, it follows 
that ^ a ^J = i^> = 0- But > since = ^1 ^ follows that Ylj=i^j^ = °- Tnus ) 

A ^ = \ £j=i = i £}=i <f = for all a. ' 

The sequence <7^, . . . , being not constant and Ylj=i ^ = ^ follows that for some 
j u j 2l = 1 and = -1. Using (4.2) and (4.3), we see that af +1) = and af +1) = 1. 
Therefore, for some j = 1, . . . , fc — 1 <5j l+1 ^ 7^ 0, so that we may apply the previous reasoning 
to the configuration a^ +1 \ We get that the values c^ 1 ^ 1 ^ do not depend on a and that 
Aa +1 ^ = 0, for all a. Applying the analogues of (4.2) and (4.3) backwards, namely 

c#> = -1 a^" 1 ) = 0, 4^ 2 ) = 0, . . . , = 0, (4.6) 

c#> = 1 => a^" 1 ) = -1, <#- 2 > = 0, ...,<#> = 0, (4.7) 

we find that c^* _1 ' > = —1 and crj* -1 ' 1 = 0. Thus, for some j = 1, . . . , k— 1, ^j* -1 ' 1 / and so we 
may apply the previous reasoning to the configuration <j( l_1 ). Hence, crj* a ^ does not depend 
on a and Aa~^ = for all a. Continuing this reasoning subsequently for d^ l+2 \ . . . 
and backwards for a^~ 2 \ . . . , a^ l \ we derive that none of the values a^ a depends on a and 
that Aq - 1 = for all a and all s = 1, . . . , k. But the system A^ = for all s = 1, . . . , fc — 1 
and a = 1, . . . , /c' _1 has only the trivial solution. Hence the system (4.4) becomes the 
system (4.1). Invoking the reasoning for I = 1, we derive that o - is a permutation of the last 
configuration a 1 . 
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Let us now turn to the second case, that is assume that for all i,j 5^ = 0. Then the 
unique solution of (4.5) is = ■ ■ ■ = = 0. Then a[^ a = a^ a = ■■■ = a ( ^ a = for 
all a = 1, . . . , k l ~ x and all « = 1, . . . , k — 1. The system (4.4) is reduced to a smaller system 
h!^ = corresponding to the matrix C(a 1 , . . . , 0" i_1 ) with all k l ~ l different columns. The 
statement of the lemma holds for it by induction. Thus in this case a is a permutation of 
one of a 1 , . . . , a 1-1 . 

Proof of Lemma 3. 3. Let us remove from the matrix C(c 1 , . . . , a 1 ) linearly dependent columns 
and leave only r columns of the basis. They correspond to a certain subset of r configurations 
<r>>W j, (3 £ A r C {1, ... ,1} x {1, ... ,k — 1}, \A r \ = r. We denote by C r (a\... ,a l ) the 
N x r matrix composed by them. Then the probability in the right-hand side of (3.12) is 
not greater than the probability of the same events for j,/3 € A r only. Let f N {{tpj}), 
j,(3 G A r , be the characteristic function of the vector (2(N/k)vai X)~ 1 / 2 {Y l3 (a : >)}j^ € A r - 
Then 

F(v/9-l *-l,VJ-l t ' yV)l < ^ , ) 



< 



v / 2(7V/fe)varX S{k,N)^ 

I f - 1 l e it/3,j^S{k,N)T^ _ e -itf, tj c%S(k,N)i^: 



lim / /^--({^}) n ~- ^, 



(27r) ^°v D V J£ P ^ 

(4.8) 

To bound the integrand in (4.8) we use that 



<min(2cf < S(A:,iV)^,2(t /3)J )- 1 ). (4.9) 

Next, let us choose in the matrix C r (a x , . . . , <r z ) any r linearly independent rows and construct 
of them a r x r matrix C rxr . Then 

AT 

ij£- v ({fo»i =n Eex p( /oJ/m y ^ 1 .- 

n=i' ^y/2(N/k)vaxX t 

<frlEexpf ; %Xs {C rxr t± s ) (4.10) 

T 

< J] min (l, 2y/2(N/k)vai X({C rxr t\ s )-^ , 

s = l 

where f = {i/jjJj./JeA,.- Hence, the absolute value of the integral (4.8) is bounded by the sum 
of two terms 

SWt) ^/ J€ - 4 - (24) / n-NWHJtHC'il.)- 1 )^ 

||t||<S(fc,iV)*=T 
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+ 7^V / 11 ( 2 fe)" 1 )nmm(l,2V2(iVA)v a rX({C-^") s )- 1 )^ )J , 

||f||>S(fc,iV)*-i 

(4-11) i 

The change of variables ff = C rxr t in the first term shows that the integral over \\t\\ < S(k, N) 
is at most 0(N r / 2 (In S(k,N)T^) r ), where lnS(k,N)T^ = O(N) as N -► oo. Thus 
the first term of (4.11) is bounded by K 1 (C rxr , k, l)N 3r / 2 S(k, N)^ with some constant 
Ki(C rxr , k, I) > independent of N. Using the change of variables ff = S(k,N)'^ ri i in 
the second term of (4.11), one finds that the integral over \\t\\ > S(k,N) 1 ^ T is at most 
0(S(k,N)^). Thus (4.11) is not greater than K 2 (C rxr , k,l)N 3r / 2 S(k, N)t^ with some 
positive constant K 2 (C rxr , k, I) independent of N. 

To conclude the proof, let us recall the fact that there is a finite, i.e. N- independent, 
number of possibilities to construct the matrix C rxr starting from C(a 1 , ... ,a l ) since each 

2 

of its elements may take only three values ±1,0. Thus there exists less than 3 r different 
constants K 2 (C rxr , k,l) corresponding to different matrices C rxr . It remains to take the 
maximal one over them to get (3.12). 

Proof of Lemma 3.4- Throughout the proof we denote by DnUC{o-) the matrix completed 
by the rows , . . . , . 

Let us denote by c 1 ,... ,c q the system of columns of the matrix Djy. Then we can 
find the indices i\ < i 2 < ■ ■ ■ < ik-t-i < k — 1 such that a^ 3 ^ is a linear combination of 
c 1 ,... , c q , cr(* 2 ), . . . , £j^ s_1 ) for all s = 1, . . . ,k—t—l. Then there exist linear coefficients 
a\ (s) , . . . , dj s _ i (s) such that 

a^c 1 + ■ ■ ■ + a q (s)c q + a q+1 {s)a (1 '> + ■ ■ ■ + a ls . 1 (s)a {i3 - 1) =a {i °\ s = 1, . . . ,k-t-l. (4.12) 

(If r < q these coefficients may be not unique, but this is not relevant for the proof.) Since 
t > 1, without loss of generality (otherwise just make a permutation of spin values {1, . . . , k} 
in a) we may assume that i\ > 1. 

Initially each of k — t — 1 systems (4.12) consists of N linear equations. But the number 
of different rows of Dn being a fixed number R, each of these k — t — 1 systems (4.12) has 
only a finite number of different equations. Thus, (4.12) are equivalent to k — t — 1 finite (i.e. 
TV-independent) systems of different equations of the form: 

ai(s)di H h a q (s)d q = a q+1 (s)S 1 H h a is _i(s)5 is _i + S ia , (4.13) 

where d = (d\, . . . , d q ) is one of the R distinct rows of the matrix Dn and Sj = 0, 1, —1. 
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Note that there exist at most R x 3 s of such equations (4.13) for any s = 1, . . . ,k — t — 1. 
Consequently, for the given matrix Dm, there exists a finite (i.e. iV-independent) number of 
such sets of k — t — 1 finite systems of distinct equations (4.13). We will denote by A the set 
of such sets of k — t — 1 finite systems of distinct equations (4.13) which do arise from some 
choice of a spin configuration a with rank [D^ U C{a)\ = r + t, after the reduction of (4.12) 
(i.e. after eliminating the same equations among all N in each of k — t — 1 systems (4.12)). 
For a G Eat, we denote by a(a) G A the set of k — t — 1 finite systems of distinct equations 
(4.13) obtained from (4.12) in this way. 

We will prove that for any given element a £ A we have the estimate: 

#{a : rank[Av U C(a)] =r + t, a{a) = a} < C^tiV^l (4 . 14) 

where C is a constant that depends only on R, t, k. Since the cardinality of A is finite and 
depends only on R,t, and k, this will prove the lemma. 

Consider some «o G A. Since by definition of A there exists ao with the property rank[L>ArU 
C(<7o)] = r + 1 and a(<7o) = ceo, then there exists a solution of all these k — t — 1 systems of 
equations «o- Let ctj(s) be any such solution. For any row d = (di, . . . ,d q ) of D^, set 

A(s,d) = a 1 (s)d 1 + a 2 (s)d 2 H a q (s)d q . (4.15) 

Then to any row d of Dn there corresponds the vector of linear combinations A(d) = 
(A(l,d), A(2, d), . . . ,A(fc - 1 - t, d)). Next, let us divide the set V of the different rows 
of the matrix into m disjoint non-empty subsets Pi, V 2 , . . . , V m such that two rows d, d 
are in the same subset, if and only if A(d) = A(d). 

Lemma 4.1: The partition T>i defined above satisfies the following properties: 
(i) m> k — t 

(ii) For any pair d G Di, d 6 T>j, with i ^ j, and for any a, such that rank[Z)jv U C(<r)] = r + 1 
and a{a) = ao, the rows d and d can not be continued by the same row O of the matrix C{a) 
in D N U C(cr). 

Proof. Let us first show that V can be divided into three non-empty subsets T> , V\, V 2 , such 
that A(l,d) / -1,0 for all d G D , A(l,d) = -1 for d G Pi, A(l,d) = for d £ P 2 . First 
of all, since ao G ^4, then there exists at least one ao such that rank[L>Ar U C(a )] = r + t 
and a(o"o) = ao- Let d ,... ,d fc_1 denote k rows (not necessarily different) of that 
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are continued by the rows Oo, . . . , Ofc-i of the matrix C{<jq) (recall the definition given in 
the paragraph following (3.7)) respectively in Dm U C{gq). Now consider a row d %1 that is 
continued by the row 0^. The corresponding equation (4.12) with s = 1 then reads 

A(l,d il ) = -1. 

This shows that the set D\ / 0. Similarly, for a row d? continued by the row Oj with j > i\, 
the corresponding equation yields 

A(l,d j ) = 0. 

Thus P 2 7^ 0- Finally, consider the rows of the matrix continued by Oo, . . . , Oi 1 -\. The 
corresponding i\ equations (4.12) with s = 1 then read : 

A(d°,l) = -o,+i 
A(d\l) = a q+1 - a q+2 

A(d 2 ,l) = a q+2 -a q+3 (4.16) 

A(tf 1) = a q+il -x + 1. 

The sum of the right-hand sides of these equations equals 1. Thus the left-hand side of at 
least one equation must be positive. Hence, there exists d? with j G {1, . . . , i\ — 1} such that 

A(d>',l)^-1,0. 

Thus also Vq / 0, and so all three sets defined above are non-empty. Moreover, V 2 includes 
all rows d that are continued by the rows Oj with j > i\ of C(o"o). 

Now, let us divide T>2 into two non-empty subsets X>2,i, ^2,2 according to the value taken 
by A(2,d). We define P 2 ,i = {d G P 2 : A(2,d) / 0}, and P 2 ,2 = G P 2 : A(2,d) = 0}. 
Note that the row cP 2 is an element of P 2 by the observation made above, while using (4.12) 
with s = 2, we get, as before that A(2, d 12 ) = — 1, and for all j > i 2 , A(2, d J ) = 0. Thus £> 2j i 
and T>2,2 are non-empty. In addition to that, for any row d continued by Oj with j > i 2 we 
have again by (4.12) with s = 2 A(2, cP ) = 0. Hence, P 2) i and P 2j2 are non-empty, and P 2j2 
contains all rows d continued by Oj with j > i 2 of C{uq). 

Using (4.12) for s = 3 we can again split P 2j2 into two non-empty subsets P 2 , 2 ,i with 
A(d, 3) / and P 2j2j2 with A(d, 3) = 0. Furthermore, V 2 ^,2 contains all rows that are 
continued by Oj with j > i 3 of C(o"o), etc. The same procedure can be repeated up to the 
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step s = k — 1 — t— 1. In this way we have subdivided X>2 into k — l — t — 1 disjoint non-empty- 
subsets. Together with Vq and T>\, these constitute k — t disjoint subsets XV This proves 
the assertion (i). 

Let us now take any a such that rank [Dpi U C(a)] = r + t and with a(a) = a>o- Assume 
that d and d are continued by the same row Oj of C(a) in U C(cr). Since d and d belong 
to different subsets XV for some u G {1, . . . , k — t — 1}, A(d, u) / A(d, u). Then, writing 
(4.12) with s = u along the row d continued by Oj and along the row d continued by Oj we 
would get either the system 

A(d,u) = 
A(d, u) = 



if j > i u , or 



if j = i u , or 



A(d,it) = -1 
A(d,u) = -1 

A(d,u) — a q+ j{u) = 1 
A(d,u) — a q+ j{u) = 1 



if J ' = iu — 1 , or finally 



A(d,u) - o 9+3 -(«) + a g+J+ i(n) = 
A(d,u) - a q+ j(u) + a g+ j+i(n) = 0, 

if 3 < iu — 1- But no one of these four systems has a solution if A(d, u) / A(d, n). This proves 
(ii)- <> 

By (ii) of Lemma 4.1, for any a such that rank [D^UC(a)] = r+t and ct{a) = ao the set of 
rows of the matrix Dm is divided into m > k — t non-empty disjoint subsets T>i, . . . , V m and 
the set of k rows of the matrix C(a) is divided into m non-empty disjoint subsets Ci, . . . ,C m 
of cardinalities Si, . . . , s m > 1, respectively, such that the rows in continue the rows of 
only. But Sj rows of the matrix C(a) must be present Nsj/k times. Thus, first of all, in the 
matrix D N , these Tj rows must be present Nsj/k times as well, for all j = 1, . . . ,m. Thus, 
the number of configurations a with rank [Dn UC(a)] = r + t such that a(a) = Qo does not 
exceed Y[J =1 ( N ^/ k k ) ( N %i )/k ) ■ ■ ■ = ((N/k)\y k UT=i(Nsj/k)\ which is bounded by 

((N/k)l)- k ((N(k-m + l)/k)\)((N/k)l) m - 1 for any Sl) ... ,s m > 1 with Sl + ■ • • + s m = k. 

By (i) of Lemma 4.1 we have k — t < m < k, so that 

((N(k - m + l)//c)!)((A^//j)!) m " 1 _ (N{k -m+ l)/k\ (N{k - m)/k\ (N/k\ 



{(N/k)\) k V N/k J\ N/k J \N/k 
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N(t + l)/fc\ (Nt/k\ (N/k\ _ {N{t + l)/k)\ 
N/k )\N/k)"' \N/k) ~ ((N/k) !)(*+!) ' 



< 

Hence, for any matrix £)jv composed of R different columns 

#{cr : rank [D N U C(cr)] = r + i, a(cr) = a } 

'N(t + l)/k\ fNt/k\ (N/k\ (N{t + l)/k)\ 



, * v v n /7V(t + l)/fc\ /JVt/fc\ (N /k\ I 



m = k — t r ly ... ,r m >l, s lt ... ,e m >l 
r 1 -\ |-T m = -R s 1 -\ \- Srn = k 



({N/k)\) t + 1 ' 
(4.17) 







Proof of Lemma 3.5 The statement (3.29) is an immediate consequence of (3.28) and (3.18), 
(3.4) if 5 > is small enough. 

The proof of (3.27) and (3.28) mimics the standard proof of the Berry-Essen inequality. 

i 

Namely, we use the representation (3.8) of /^ 1 ''"' <T ({^}) as a product of N characteristic 
functions where at most k l of them are different. Each of them by standard Taylor expansion 



i ( I {^=i}- 1 {<=i+i})*/3-i) 
(too}) = 1 - i ° 1V "'-' A( , T n^ v var.V 



A(N/k)v&rX 



( 1 Wi=i}- 1 {ai=i+l}) t l3j) 

x I— 1 , . . . ,fe — 1 / 



-On J '" 1 '""' „ r~, ^ E(X -EXf = l- r n (4.18) 

6((2iV/£i)varX) 3 / 2 V ' K ' 

with |0 n | < 1. It follows that \r n \ < d^N' 1 + C 2 1 1 *1 1 3 iV~ 3 / 2 , for some d,C 2 > 0, all 
a 1 ,... ,a l , and all n. Then |r„| < 1/2 and \r n \ 2 < C 3 1 1 1 3 iV — 3 / 2 , for some C 3 > and all 
t satisfying \\t\\ < 5V~N, with 5 enough small. Thus, ln/^^"' CT ({tpj}) = —r n + 9 n r 2 l /2 
(using the expansion ln(l + z) = z + 9z 2 /2 for ||z|| < 1/2 with ||#|| < 1) , with some \6 n \ < 1 
for all a 1 ,... ,o~ l , all n, and all t satisfying < 5y/~N . It follows that ({tpj}) = 

exp(- En=i r n + En=i #n r n/ 2 )- Here ~ En=i r « = ~tB N (a 1 , ... , a l )t/2 + Yl^=i Pn where 
Kl < C 2 ||tl| 3 iV- 3 / 2 . Then 

fN'-^dtpj}) = e- tew(ffl "'^eEL^+W/ 2 ), (4.19) 

where |p n | + |fl n r2/2| < (a 2 + C 3 /2) ||tl| 3 iV- 3 / 2 . Hence | e S»=i (p " +g " r » /2) -l| < C 4 1 1 | 3 1 / 2 , 
for all f satisfying < eiV 1 / 6 with e > small enough. Moreover, | Yln=i(P™ + ^« r n/2)| < 
C 5 ||ij| 3 iV~ :l / 2 , which implies (3.28). This concludes the proof of Lemma 3.5 
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5. The unrestricted partioning problem. 

In the previous section we considered the state space of spin configurations where the 
number of spins taking each of k values is exactly N/k. Here we want to discuss what happens 
if all partitions are permitted. Naturally, we divide again the space of all configurations 
{1, . . . , k} N into equivalence classes obtained by permutations of spins. Thus our state space 
Sat has k N (kl) -1 elements. Let us define the random variables Y /3 (a) as in the previous 
section, see (1.4). Then we may state the following conjecture analogous to Theorem 1.1. 

Conjecture 5.1: Let 

V^a) = k^-^N-^k'/^kiy^-'K-^VSlY^a^ = 1,..., k-1. (5.1) 



Then the point process on 1 



,V k - 1 (cr)) 

converges to the Poisson point process on K+ _1 with the intensity measure which is the 
Lebesgue measure. 

Using Theorem 2.1, the assertion of the conjecture would be an immediate consequence of 
the following conjecture, that is the analogue of Proposition 3.1. 

Conjecture 5.2: Denote by Yla 1 o-'es^C') the sum over all possible ordered sequences of 
different elements o/Stv- Then for any I = 1,2,... , any constants > 0, j = 1, . . . , I, 
(3 = 1,... , k — 1 we have: 

E pfo - 1,... - iyj = i.... ,i j£{$L* K («)--/<*- C W-> ) 

= 1,... ,k-l 



Remark: One can notice the difference between the right-hand sides of (3.2) and (5.2). 
In spite of this difference, the proof of this statement proceeds along the same lines as 
that of Proposition 3.1. The only point that we were not able to complete is that the 
sum analogous to in (3.10) (recall that it is a sum over sets a 1 ,... ,o l such that the 
system {Y f3 (a : ')} 3 =i, is linearly dependent and, moreover, for any basis of this system 
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there exists a number j G {l,-- - ,1} such that for some non-empty subset of coordinates 
(3 G {1, . . . ,k — 1} the random variables V^((T J ') are included in this basis and for some 
non-empty subset of coordinates G {1, . . . , fe — 1} they are not included there) converges 
to as N — ► oo. Therefore the whole statement remains a conjecture. 

Remark: The case k = 2. In the case k = 2 the sum Jjy is absent. Hence, in this case 
we can provide an entire proof of (5.2) and therefore prove our conjecture. The result in the 
case k = 2 is not new: it has been already established by Ch. Borgs, J. Chayes and B. Pittel 
in [BCP], Theorem 2.8. Our Theorem 2.1 gives an alternative proof for it via (5.2). 

Finally we sketch the arguments that should lead to (5.2) and explain the differences with 
(3.2). To start with, similarly to (3.9), we split 

E p (-)+ E p (-)- ( 5 - 3 ) 

cr 1 ,... .cr'eSjv a 1 ,... ,a l €S N 

ranker 1 ,... ,^)={k-i)i rankcv 1 ,... ^'xtuji 

We are able to prove that the first part of (5.3) converges to the left-hand side of (5.2). For 
that purpose, we introduce again "the main part" of the state space with a 6 (0, 1/2): 

Hf N = jo- 1 ,... ,a l G Sat : VI < j < J, VI < i < r < 1, 1 < 0,7,77 < k,(3^ 7 

N N 

I E Ki=p - N / k \ < I Et 1 ^} - h<=,})M<=v}\ < N a VN} (5.4) 

n=l n=l 

where 

H^ivll > k Nl (l - exp(-hN 2a ))(k\)- 1 (5.5) 
and split the first term of (5.3) into two terms 

E p (-)+ E p (-)- ( 5 - 6 ) 

rankcf^ 1 ,... ,ai)=(*-i)i 

The second term of (5.6) converges to zero exponentially fast: the number of configurations 
in it is at most 0(exp(-hN 2a )k Nl ) by (5.5), while the probability P(-) = 0(N l k~ Nl ) by the 
analogue of Lemma 3.3. 

To treat the first term of (5.3), let us stress that an important difference compared to 
the previous sections is the fact that the variables Y l3 (a) are now not necessarily centered. 
Namely, 

N 

EY?(a) = (EX) Et 1 !^} " *{* n =f>+i}) = KX \#i n : cr n = f3] - #{n : a n = f3 + 1}] 

71 = 1 

(5.7) 
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as it may happen that #{n : a n = (3} ^ #{n : a n = (3 + 1}. 

Taking this observation into account and proceeding similarly to the analysis of (3.21), we 
can show that, uniformly for all a 1 , . . . , a 1 G TZf N , 

k- N \k\) l \[,J2cP) ( EYf B- 1 EYf x 
p (0 = i-nf/ - expf- j=== =- —=4==) +0 {k- m ) (5.8) 



( 27r )(fc-i)i/2 ^/2{N/k)vaxX 2 ^2(iV/£;)varX/ 

where the matrix B consists of I diagonal blocks (k — 1) x (k — 1), each block having 1 on the 
diagonal, —1/2 on the line under the diagonal and everywhere else. Thus the first term of 
(5.6) by (5.8) and (5.5) equals 

ggl^ , EYf B-i EYf , 

^ . a (2vr)( fc - 1 )V2 P V ^/2(N/k)v&iX 2 ^(N/^varX <> KJ 

n,-,g(2cf) / EYf g-i \ 

" (27r)^-W2^.-^ ex n V2(^/^)varX 2 ,/2{N/k)vaiX ) ° { j " 

(5.9) 

By the Central Limit Theorem the vector X^n=i(^-{ (T J '=/3} ~~ 2{ ct j =/j+i} ) / \/ 2N/k on con- 
verges to a Gaussian vector Zj 3 with zero mean and covariance matrix B as N — > oo. Hence, 
(5.9) converges to 

(27r)(fe -W2^ ex Pl 2 *j ~ 11 ^ {EX) 2 +vaiX ( 5 - 10 ) 

which is the right-hand side of (5.2). This finishes the analysis of the first term of (5.3). 

To treat the second term, we split it into two parts and analogously to (3.10). 
The analysis of J\ is exactly the same as in the proof of Proposition 3.1 and relies on 
Lemmatas 3.3 and 3.2. 

However, the problem with the sum jjj persists. First of all, this sum contains much 
more terms than in the case of the previous section as it consists essentially of configurations 
cr 1 , . . . , a 1 where some of the values of spins (3 among {1, . . . , k} figure out more often than 
others, i.e. #{n : a n = (3} > #{n : a n = (3 + 1}. Lemma 3.4 is not valid anymore. Second, 
for all such configurations a, the random variables Y l3 (a) are not centered and consequently 
the estimate of the probability P(-) suggested by Lemma 3.3 is too rough. We did not manage 
to complete the details of this analysis. 
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